119 results found.
Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
Freely Available
License:
Open Source
Size:
26,724 tokens, 7,503 sentences sentences Production Status:
Newly created-in progress
Use:
Text Mining
-
Paper title:An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Eshrag Refaee | Heriot-Watt University | GB |
| Author 2 | Verena Rieser | Heriot-Watt University | GB |
| Main Contact | Eshrag Refaee | Heriot-Watt University | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English Standard Arabic
Availability:
Freely Available
License:
OpenSource
Size:
100.1 MByte Production Status:
Existing-updated
Use:
Corpus Creation/Annotation
-
Paper title:OSMAN – A Novel Arabic Readability Metric
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Mahmoud El-Haj | Lancaster University | GB |
| Author 2 | Paul Rayson | Lancaster University | GB |
| Main Contact | Mahmoud El-Haj | Lancaster University | None |
Documentation:
<Not Specified>
Written
Annotation Tool,
Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
Freely Available
License:
GNU
Size:
10 <Not Specified>Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Arabic-Segmentation Combination Strategies for Statistical Machine Translation
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Saab Mansour | RWTH Aachen University | None | ||||
| Author 2 | Hermann Ney | RWTH Aachen University | DE | RWTH Aachen University | None | RWTH Aachen | DE |
| Main Contact | Saab Mansour | RWTH Aachen University | DE |
Documentation:
http://www-i6.informatik.rwth-aachen.de/~mansour/MorphSegmenter/Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
From Owner
License:
CreativeCommons
Size:
2000000 words Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Wajdi Zaghouani | Carnegie Mellon University | QA | ||
| Author 2 | Nizar Habash | New York University Abu Dhabi | US | Columbia University | AE |
| Author 3 | Ossama Obeid | Carnegie Mellon University in Qatar | QA | ||
| Author 4 | Behrang Mohit | Ask.com | US | ||
| Author 5 | Houda Bouamor | Carnegie Mellon University | QA | ||
| Author 6 | Kemal Oflazer | Carnegie Mellon University - Qatar | QA | ||
| Main Contact | Wajdi Zaghouani | Carnegie Mellon University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
From Owner
License:
<Not Specified>
Size:
2M words Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Applying the Cognitive Machine Translation Evaluation Approach to Arabic
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Irina Temnikova | Qatar Computing Research Institute | BG | Qatar Computing Research Institute, HBKU | QA |
| Author 2 | Wajdi Zaghouani | Carnegie Mellon University | QA | ||
| Author 3 | Stephan Vogel | Qatar Computing Research Institute | QA | ||
| Author 4 | Nizar Habash | New York University Abu Dhabi | US | ||
| Main Contact | Irina Temnikova | Qatar Computing Research Institute, HBKU | None | Sofia University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
From Owner
License:
OpenSource
Size:
77430 words Production Status:
Existing-updated
Use:
Language Modelling
-
Paper title:Automatically generated, phonemic Arabic-IPA pronunciation tiers for the Boundary Annotated Qur'an Dataset for Machine Learning (version 2.0)
-
Paper track:<Not Specified>
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Majdi Sawalha | The University of Jordan | JO |
| Author 2 | Claire Brierley | University of Leeds | GB |
| Author 3 | Eric Atwell | University of Leeds | GB |
| Main Contact | Majdi Sawalha | The University of Jordan | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
Not Applicable
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Emotion Recognition/Generation
-
Paper title:AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Muhammad Abdul-Mageed | <Not Specified> | None |
| Author 2 | Mona Diab | Columbia University | None |
| Main Contact | Muhammad Abdul-Mageed | Indiana University, Bloomington | US |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
Freely Available
License:
ELRA
Size:
20151 tweets OtherProduction Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Towards a Corpus of Violence Acts in Arabic Social Media
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Ayman Alhelbawy | Essex University | GB | ||
| Author 2 | Poesio Massimo | Essex University | GB | ||
| Author 3 | Udo Kruschwitz | University of Essex | GB | Essex University | GB |
| Main Contact | Ayman Alhelbawy | Essex University | None |
Documentation:
<Not Specified>
Written
Software Toolkit,
Language Type:
Multilingual
Languages:
English Standard Arabic
Availability:
Freely Available
License:
OpenSource
Size:
150 MByte Production Status:
Newly created-finished
Use:
Discourse
-
Paper title:OSMAN – A Novel Arabic Readability Metric
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Mahmoud El-Haj | Lancaster University | GB |
| Author 2 | Paul Rayson | Lancaster University | GB |
| Main Contact | Mahmoud El-Haj | Lancaster University | None |
Documentation:
OSMAN Arabic Readability Metrics
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English Standard Arabic
Availability:
From Data Center(s)
License:
SCOLA
Size:
see description OtherProduction Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Claire Jaja | <Not Specified> | None |
| Author 2 | Douglas Briesch | <Not Specified> | None |
| Author 3 | Jamal Laoudi | <Not Specified> | None |
| Author 4 | Clare Voss | <Not Specified> | None |
| Main Contact | Claire Jaja | Army Research Lab (ARL), Advanced Resources Technologies Inc (ARTI) | US |
Documentation:
See the SCOLA website for English documentation




